Information-Theoretic Active SOM for Improving Generalization Performance
نویسنده
چکیده
In this paper, we introduce a new type of information-theoretic method called “information-theoretic active SOM”, based on the self-organizing maps (SOM) for training multi-layered neural networks. The SOM is one of the most important techniques in unsupervised learning. However, SOM knowledge is sometimes ambiguous and cannot be easily interpreted. Thus, we introduce the information-theoretic method to produce clearer and interpretable representations. The present method extends this information-theoretic approach into supervised learning. The main contribution can be summarized by three points. First, it is shown that clear representations by the information-theoretic method can be effective in training supervised learning. Second, the method is sufficiently simple where there are two separated components, namely, information maximization and error minimization component. Usually, two components are mixed in one framework, and it is difficult to compromise between them. In addition, the knowledge obtained by this information-theoretic SOM can be used to solve the shortage of unlabeled data, because the information maximization component is unsupervised and can process all input data with and without labels. The method was applied to the well-known image segmentation datasets. Experimental results showed that clear weights were produced and generalization performance was improved by using the information-theoretic SOM. In addition, the final results were stable, almost independent of the parameter values. Keywords—SOM; Labeled and Unlabeled; Supervised and Unsupervised; Generalization; Interpretation
منابع مشابه
Supervised Potentiality Actualization Learning for Improving Generalization Performance
The present paper proposes an application of potentiality learning to supervised learning. The potentiality has been developed as a measure of the importance of components in the self-organizing maps (SOM) to extract important input neurons. The main characteristics lies in its simplicity and thus it can be easily implemented. If it is possible to use it for conventional supervised learning, be...
متن کاملAn Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کاملTexture Classification by Combining Local Binary Pattern Features and a Self-Organizing Map
This paper deals with the combined use of Local Binary Pattern (LBP) features and a Self-Organizing Map (SOM) in texture classification. With this approach, the unsupervised learning and visualization capabilities of a SOM are utilized with highly efficient histogram-based texture features. In addition to the Euclidean distance normally used with a SOM, an information theoretic log-likelihood (...
متن کاملNGTSOM: A Novel Data Clustering Algorithm Based on Game Theoretic and Self- Organizing Map
Identifying clusters is an important aspect of data analysis. This paper proposes a noveldata clustering algorithm to increase the clustering accuracy. A novel game theoretic self-organizingmap (NGTSOM ) and neural gas (NG) are used in combination with Competitive Hebbian Learning(CHL) to improve the quality of the map and provide a better vector quantization (VQ) for clusteringdata. Different ...
متن کاملA Continuous Self-Organizing Map using Spline Technique for Function Approximation
We propose a new method called C-SOM for function approximation. C-SOM extends the standard Self-Organizing Map (SOM) with a combination of Local Linear Mapping (LLM) and cubic spline based interpolation techniques to improve standard SOMs' generalization capabilities. CSOM uses the gradient information provided by the LLM technique to compute a cubic spline interpolation in the input space bet...
متن کامل